Learning a Reversi Board Evaluator with Minimax
نویسنده
چکیده
A board position evaluator is a crucial component for strong computer play in many games such as checkers, chess, and Reversi. The board evaluator is typically trained using pre-existing game data, an approach which is generally non-optimal, especially in the early stages of a game. Instead, we propose a new method which relies on Minimax search to train a series of models backwards from the endgame, propagating information about endgame scoring backwards to earlier positions. In the limit of perfect models, our method is optimal, converging to the full Minimax board evaluation. We test our method experimentally by training a simple Reversi model using both our Minimax method and high-level tournament game data. When played against each other, our model outperforms the game data model, averaging 5 more stones per game, and winning approximately 60% of the matches.
منابع مشابه
Using an Evaluator Fixed Structure Learning Automata in Sampling of Social Networks
Social networks are streaming, diverse and include a wide range of edges so that continuously evolves over time and formed by the activities among users (such as tweets, emails, etc.), where each activity among its users, adds an edge to the network graph. Despite their popularities, the dynamicity and large size of most social networks make it difficult or impossible to study the entire networ...
متن کاملRion Snow
Here we present the algorithms and extensions that produced our successful Othello-playing program, which finished among the top five programs of this year’s Othello tournament. Specifically, we discuss a variety of search algorithms and extensions, as well as our algorithms for adaptive timemanagement, our efficient ‘bit board’ data structure, our features comprising our evaluation function, o...
متن کاملMachine Learning of Othello Heuristics
The machine learning algorithm of [3] is applied to the problem of learning which heuristics to apply when playing the board game Othello. The problem is large, for there are 46,875 heuristics considered. The results are respectable; the Learner is able to beat a practiced human player approximately fifty percent of the time. Suggestions for improvement are included.
متن کاملParallel Minimax Tree Searching on GPU
The paper describes results of minimax tree searching algorithm implemented within CUDA platform. The problem regards move choice strategy in the game of Reversi. The parallelization scheme and performance aspects are discussed, focusing mainly on warp divergence problem and data transfer size. Moreover, a method of minimizing warp divergence and performance degradation is described. The paper ...
متن کاملAn Intelligent Othello Player Combining Machine Learning and Game Specific Heuristics
In this paper we present an intelligent Othello game player that combines game-specific heuristics with machine learning techniques for move selection. Five game specific heuristics have been proposed; some of which can be generalized to fit other games. For machine learning techniques, the normal Minimax algorithm along with a custom variation is used as a base. Genetic algorithms and neural n...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015